Discriminative Training for HMM-Based Offline Handwritten Character Recognition

نویسندگان

  • Roongroj Nopsuwanchai
  • Daniel Povey
چکیده

In this paper we report the use of discriminative training and other techniques to improve performance in a HMMbased isolated handwritten character recognition system. The discriminative training is Maximum Mutual Information (MMI) training; we also improve results by using composite images which are the concatenation of the raw images, rotated and polar transformed versions of them; and we describe a technique called block-based Principal Component Analysis (PCA). For effective discriminative training we need to increase the size of our training database, which we do by eroding and dilating the images to give a threefold increase in training data. Although these techniques are tested using isolated Thai characters, both MMI and block-based PCA are applicable to the more difficult task of cursive handwriting recognition.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model

In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...

متن کامل

RWTH OCR: A Large Vocabulary Optical Character Recognition System for Arabic Scripts

We present a novel large vocabulary OCR system, which implements a 5 confidenceand margin-based discriminative training approach for model adap6 tation of an HMM based recognition system to handle multiple fonts, different 7 handwriting styles, and their variations. Most current HMM approaches are HTK 8 based systems which are maximum-likelihood (ML) trained and which try to adapt 9 their model...

متن کامل

Model-based online handwritten digit recognition

This paper presents a hidden Markov model (HMM) based approach to on-line handwritten digit recognition using stroke sequences. In this approach, a character instance is represented by a sequence of symbolic strokes, and the representation is obtained by component segmentation and stroke classification. The component segmentation is based on the delta lognormal model of handwriting generation. ...

متن کامل

Neural Network Based Recognition System Integrating Feature Extraction and Classification for English Handwritten

Handwriting recognition has been one of the active and challenging research areas in the field of image processing and pattern recognition. It has numerous applications that includes, reading aid for blind, bank cheques and conversion of any hand written document into structural text form. Neural Network (NN) with its inherent learning ability offers promising solutions for handwritten characte...

متن کامل

Offline handwritten word recognition using a hybrid neural network and hidden Markov model

This paper describes an approach to combine neural network (NN) and Hidden Markov models (HMM) for solving handwritten word recognition problem. The preprocessing involves generating a segmentation graph that describes all possible ways to segment a word into letters. To recognize a word, the NN computes the observation probabilities for each letter hypothesis in the segmentation graph. The HMM...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003